69 research outputs found

    PENGUJIAN COGNITIVE WALKTHROUGH ANTARMUKA PERPUSTAKAAN DIGITAL (E-LIBRARY) PUSAT DOKUMENTASI DAN INFORMASI ILMIAH ā€“ LIPI (PDII-LIPI)

    Get PDF
    Interface design has a significant role towards the successful of digital librarys application use. Digital libraryservices developed by Centre for Scientific Documentation and Information - Indonesian Institute of Sciences (PDII-LIPI) has yet to be evaluated. This research analyzed the design of the web interface of PDII-LIPIā€™s digitallibraries using the method of cognitive walkthrough (CW). The aim of the research is to identify user constraints inusing PDII-LIPIā€™s digital libraries. Object of this study are three menus in digital library web interface that is ā€œKaryaIlmiah Indonesiaā€, ā€œBuku Elektronikā€, and ā€œJurnal Indonesia (ISJD)ā€. CW testing parameters for PDII-LIPIā€™s digitallibrary interface consists of the successful completion of the task, and the effectiveness of the task. The successfulcompletion of the task was assessed by comparing the standard time with task completion time by respondents. Effectiveness accessed based on the processing time of each stage and the number of mistakes made by therespondent. The test results showed that all respondents successfully completed the task with the time that goesbeyond the standard set time. The analysis was conducted on all test results indicate that the obstacles faced byusers in general are finding menu of ā€œE-Libraryā€, specify the search facility is used, and searching the articles

    Optimization of Spaced K-mer Frequency Feature Extraction using Genetic Algorithms for Metagenome Fragment Classification

    Get PDF
    K-mer frequencies are commonly used in extracting features from metagenome fragments. In spite of this, researchers have found that their use is still inefficient. In this research, a genetic algorithm was employed to find optimally spaced k-mers. These were obtained by generating the possible combinations of match positions and don't care positions (written as *). This approach was adopted from the concept of spaced seeds in PatternHunter. The use of spaced k-mers could reduce the size of the k-mer frequency feature's dimension. To measure the accuracy of the proposed method we used the naƃĀÆve Bayesian classifier (NBC). The result showed that the chromosome 111111110001, representing spaced k-mer model [111 1111 10001], was the best chromosome, with a higher fitness (85.42) than that of the k-mer frequency feature. Moreover, the proposed approach also reduced the feature extraction time.

    Clustering topic groups of documents using K-Means algorithm: Australian Embassy Jakarta media releases 2006-2016

    Get PDF
    Introduction. The Australian Embassy in Jakarta is storing a wide array of media release document. Analyzing particular and vital patterns of the documents collection is imperative as it will result in new insights and knowledge of significant topic groups of the documents.Methodology. K-Means was used algorithm as a non-hierarchical clustering method which partitioning data objects into clusters. The method works through minimizing data variation within cluster and maximizing data variation between clusters.Ā Data Analysis.Ā  Of the documents issued between 2006 and 2016, 839 documents were examined in order to determine term frequencies and to generate clusters. Evaluation was conducted by nominating an expert to validate the cluster result.Results and discussions. The result showed that there were 57 meaningful terms grouped into 3 clusters. ā€œPeople to people linksā€, ā€œeconomic cooperationā€, and ā€œhuman developmentā€ were chosen to represent topics of the Australian Embassy Jakarta media releases from 2006 to 2016.Conclusions. Text mining can be used to cluster topic groups of documents. It provides a more systematic clustering process as the text analysis is conducted through a number of stages with specifically set parameters.Ā 

    Identification of Tuna and Mackerel based on DNA Barcodes using Support Vector Machine

    Get PDF
    Tuna and mackerel are important fish in Indonesia that have great demand in the community and contain good nutrients for health. Many of the processed products have been faked including processed fish, by replacing the content of products that have high sales value to other lower price one. For ensuring food safety, fraudulent should be prevented by identifying the content of refined product. In this research, we implemented support vector machine (SVM), one of the popular methods in machine learning, to yield a model for identifying the content of refined product based on DNA barcode sequences. The feature extraction of DNA barcode Sequences was conducted by calculating k-mers frequency of each sequences. In this study, we used trinucleotide (3-mers) and tetranucleotide (4-mers). These features were inputted to SVM to classify and identify whether the DNA barcode sequences belong to the class of tuna, mackerel, or other fish. The evaluation results showed model SVM was able to perform identification with the accuracy 88%

    SISTEM MANAJEMEN PENGETAHUAN PERLINDUNGAN ANAK (STUDI KASUS: SAKTI PEKSOS DI KEMENTERIAN SOSIAL)

    Get PDF
    In 2014, The Ministry of Social Affairs ofThe Republic of Indonesia (MOSA) send 670 social workers, called Ć¢ā‚¬Å“Sakti PeksosĆ¢ā‚¬ to all provinces in Indonesia. Sakti Peksos, which stands for Ć¢ā‚¬Å“Satuan Bakti Pekerja SosialĆ¢ā‚¬ is the social workers who assist the Child Welfare Program in Indonesia or known as Ć¢ā‚¬Å“PKSAĆ¢ā‚¬. To assist the childrenĆ¢ā‚¬ā„¢s case, the social workers need many practice knowledge of how to deal with the childrenĆ¢ā‚¬ā„¢s issues. Therefore, it is beneficial to manage the existing information by developing knowledge management system. Through the application based web program, this knowledge management system will develop all the Sakti PeksosĆ¢ā‚¬ā„¢s knowledge. This web aims at organizing all the informations that they have so that it can be shared to other Sakti Peksos. The purpose of this study was to develop a Child Protection Knowledge Management System for Sakti Peksos in the Ministry of Social Affairs. This research uses Knowledge Management System Life Cycle methodology which is adopted from award and ghaziri methodology. This research involves four steps; gathering the knowledge, designing the brueprint of knowledge management system, helding verification as well as validation process, and implementing knowledge management system. The result of this research is a web based child protection knowledge management system, equipped with CakePHP framework and MySQL as Relational Database Management System (RDBMS)

    Klasifikasi Kanker Tumor Payudara Menggunakan Arsitektur Inception-V3 Dan Algoritma Machine Learning

    Get PDF
    Breast cancer is a disease that arises due to breast tissue cells that grow abnormally and continuously. This disease is a disease with a large increase in number of around 13 million per year, with a mortality rate of 9.6% from a total of 65,858 cases. Early detection of breast cancer for prevention needs to be done, with the hope that breast cancer is easier to treat and cure and can even be prevented before it enters an advanced stage. In this research, build a model with transfer learning technique for breast cancer classification. There are 4 methods tested, namely Inception-V3 feature extraction with the Radial Basic Function Neural Network classification method, FeedForward Neural Network, Logistic Regression and feature extraction by making changes to the hyperparameter layer. This study compares the four models to get the best one to solve the problem of breast cancer classification. The data used in this study are breast cancer image data with a zoom scale of 40X, 100X, 200X and 400X. The dataset was sourced from The Laboratory University of Parana with P&D Laboratory Pathological Anatomy and Cytopathology, Parana, Brazil. The results of this study indicate that the Inception-V3 feature extraction method with the Logistic Regression classification method on the 40X zoom scale data provides the best accuracy (93.00%), precision (94.00%), and recall (91.00%) F1-score (92.00%)

    Cluster Analysis for SME Risk Analysis Documents Based on Pillar K-Means

    Get PDF
    In Small Medium Enterpriseā€™s (SME) financing risk analysis, the implementation of qualitative model by giving opinion regarding business risk is to overcome the subjectivity in quantitative model. However, there is another problem that the decision makers have difficulity to quantify the riskā€™s weight that delivered through those opinions. Thus, we focused on three objectives to overcome the problems that oftenly occur in qualitative model implementation. First, we modelled risk clusters using K-Means clustering, optimized by Pillar Algorithm to get the optimum number of clusters. Secondly, we performed risk measurement by calculating term-importance scores using TF-IDF combined with term-sentiment scores based on SentiWordNet 3.0 for Bahasa Indonesia. Eventually, we summarized the result by correlating the featured terms in each cluster with the 5Cs Credit Criteria. The result shows that the model is effective to group and measure the level of the risk and can be used as a basis for the decision makers in approving the loan proposal.

    Performance Comparison of Data Sampling Techniques to Handle Imbalanced Class on Prediction of Compound-Protein Interaction

    Get PDF
    The prediction of Compound-Protein Interactions (CPI) is an essential step in the drug-target analysis for developing new drugs as well as for drug repositioning. One challenging issue in this field is that commonly there are more numbers of non-interacting compound-protein pairs than interacting pairs. This problem causes bias, which may degrade the prediction of CPI. Besides, currently, there is not much research on CPI prediction that compares data sampling techniques to handle the class imbalance problem. To address this issue, we compare four data sampling techniques, namely Random Under-sampling (RUS), Combination of Over-Under-sampling (COUS), Synthetic Minority Over-sampling Technique (SMOTE), and Tomek Link (T-Link). The benchmark CPI data: Nuclear Receptor and G-Protein Coupled Receptor (GPCR) are used to test these techniques. Area Under Curve (AUC) applied to evaluate the CPI prediction performance of each technique. Results show that the AUC values for RUS, COUS, SMOTE, and T-Link are 0.75, 0.77, 0.85 and 0.79 respectively on Nuclear Receptor data and 0.70, 0.85, 0.91 and 0.72 respectively on GPCR data. These results indicate that SMOTE has the highest AUC values. Furthermore, we found that the SMOTE technique is more capable of handling class imbalance problems on CPI prediction compared to the remaining three other techniques

    Deep learning optimization for drug-target interaction prediction in COVID-19 using graphic processing unit

    Get PDF
    The exponentially increasing bioinformatics data raised a new problem: the computation time length. The amount of data that needs to be processed is not matched by an increase in hardware performance, so it burdens researchers on computation time, especially on drug-target interaction prediction, where the computational complexity is exponential. One of the focuses of high-performance computing research is the utilization of the graphics processing unit (GPU) to perform multiple computations in parallel. This study aims to see how well the GPU performs when used for deep learning problems to predict drug-target interactions. This study used the gold-standard data in drug-target interaction (DTI) and the coronavirus disease (COVID-19) dataset. The stages of this research are data acquisition, data preprocessing, model building, hyperparameter tuning, performance evaluation and COVID-19 dataset testing. The results of this study indicate that the use of GPU in deep learning models can speed up the training process by 100 times. In addition, the hyperparameter tuning process is also greatly helped by the presence of the GPU because it can make the process up to 55 times faster. When tested using the COVID-19 dataset, the model showed good performance with 76% accuracy, 74% F-measure and aĀ speed-up value of 179

    Pengkontruksian Bidirected Overlap Graph untuk Perakitan Sekuens DNA

    Get PDF
    De novo DNA (Deoxyribonucleic Acid) sequence assembly atau perakitan sekuens DNA secara De novo adalah tahapan yang sangat penting dalam analisis sekuens DNA. Tahapan ini diperlukan untuk merakit atau menyambungkan kembali fragmen-fragmen DNA (reads) yang dihasilkan oleh Next Generation Sequencing menjadi genom yang utuh. Masalah perakitan DNA ini dapat direpresentasikan sebagai masalah Shortest Common Superstring (SCS). Perakitan ini memerlukan bantuan perangkat lunak untuk mendeteksi daerah yang sama pada reads DNA (overlap), mengkonstruksi overlap graph, dan kemudian mencari shortest path dari graf yang terbentuk. Metode ini dinamakan Overlap Layout Consensus (OLC). Hal yang penting dalam metode OLC adalah pendeteksian overlap dari masing-masing reads. Pada penelitian ini dikembangkan suatu teknik untuk membuat bidirected overlap graph. Suffix array digunakan untuk menentukan bagian overlap dari setiap reads dengan melakukan pengindeksan setiap suffix dari reads. Proses perakitan sekuens DNA merupakan suatu proses komputasi yang intensif. Untuk mengefisiensikanĀ  proses dilakukan perubahan masing-masing suffix dan prefix menjadi suatu nilai tertentu yang bersifat tunggal dan mencari overlap dengan membandingkan angka yang merupakan representasi dari setiap reads. Cara ini lebih efisien dibandingkan melakukan pendeteksian overlap dengan metode pencocokan Ā string. Hasil perbandingan menunjukkan bahwa waktu yang diperlukan untuk mengeksekusi metode yang diusulkan (perbandingan angka) jauh lebih singkat dibandingkan dengan menggunakan metode pencocokan string. Untuk jumlah reads 2000 dan 5000 reads teknik yang diusulkan ini dapat menghasilkan overlap graph yang 100% akurat di mana semua reads dapat direpresentasikan ke dalam node yang dikonrtruksi dan semua overlap dapat direpresentasikan ke dalam edge.Ā AbstractDe novo DNA sequence assembly is the important step in DNA sequence analysis. This step is required for assembling fragments or reads produced by Next Generation Sequencing to yield a whole genome. The problem of DNA assembly could be represented as the Shortest Common Superstring (SCS) problem. The assembly requires a software for detecting the overlap region among reads, constructing an overlap graph, and finding the shortest path from the overlap graph.. This method is popular as The Overlap Layout Consensus (OLC). The most important step in OLC is detecting overlaps among reads. This study develop a new approach to construct bidirected overlap graph. Suffis array is used for detecting overlap region from each reads by indexing suffix of each reads. DNA assembly process is computational intensive. To reduce the execution time suffix and prefix was converted into the single value so that the detection of overlap could be done by comparing the values. This method is much more efficient compared to that of using string matching. Using 2000 and 5000 reads, the proposed method (value comparison) could yield the perfect overlap graph, in which all reads and overlap could be represented as nodes and edges, respectively.Ā 
    • ā€¦
    corecore